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AMENDMENTS TO THE CLAIMS: 
Please cancel claim 13 without prejudice or disclaimer. 

1 . (Currently amended) A computer-implemented m ethod for identifying relationships 
between text documents and structured variables pertaining to said text documents, 
comprising: 

generating a dictionary of keywords in said text documents; 
forming categories of said text documents using said dictionary and an automated 
algorithm; 

counting occurrences of said structured variables, said categories* and combinations of 
said structured variables and said categories for otructurod voriablo/cat e gnTy nnrnhnintir i w a-w 
said text documents; and 

calculating probabilities of occurrences of said combinations of structured variables 
and categories to identify a relationship between said text documents and said structured 
variables s tructur e d variable/category combinations 

2. (Original) The method according to claim 1, wherein said algorithm comprises a 
keyword occurrence algorithm and wherein each of said categories comprises a category of 
text documents in which a particular keyword occurs. 

3. (Original) The method according to claim 1, wherein said algorithm comprises a 
clustering algorithm and wherein each of said categories comprises a category of said text 
documents containing a particular cluster. 

4. (Original) The method according to claim 3, wherein said clustering algorithm 
comprises a k means algorithm. 

5. (Previously presented) The method according to claim 3, wherein said forming 
said categories comprises inputting a predetermined number of categories. 

6. (Previously presented) The method according to claim 2, wherein said forming 
said categories comprises: 

PAGE 3/13 * RCVD AT 4/26/2005 12:40:20 PM [Eastern Daylight Time] ' SVR:UBPTO-EFXRF-1/2 ' DN18:8729306 * CSID: 70376 12375 * DURATION (mm-ss): 03-22 



04/26/2005 12:40 FAX 7037612375 



McGinn&Gibb.PLLC 



@004 



Serial No. 09/837,158 3 
Docket No. ARC92000001 8US 1 

generating a sparse matrix array containing a count of each of said keywords in each 
of said text documents. 

7. (Previously presented) The method according to claim 1, wherein said keywords 
comprise at least one of words and phrases which occur a predetermined number of times in 
said text documents. 

8. (Original) The method of claim 1, wherein said calculating probabilities comprises 
using a Chi squared function. 

9. (Original) The method of claim 1, wherein said generating a dictionary of keywords 
comprises: 

first parsing text in said text document to identify and count occurrences of words; 
storing a predetermined number of frequently occurring words; 
second parsing text in said text documents to identify and count occurrences of 
phrases; and 

storing a predetermined number of frequently occurring phrases. 

10. (Original) The method according to claim 9, wherein said frequently occurring words 
and phrases are stored in a hash table. 

1 1 . (Original) The method according to claim 6, wherein said generating a sparse matrix 
array comprises: 

third parsing text in said text documents to count a number of times that each of said 
keywords occurs in each of said text documents. 

12. (Currently amended) The method according to claim 1 , wherein said relationships 
comprise said combinations of structured variables and categories structur e d 
variablo/oatogory combinations having a lowest probability of occurrence. 

13. (Canceled) 
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14. (Currently amended) A computer-imrjlemented method for identifying relationships 
between text documents and structured variables pertaining to said text documents, 
comprising: 

generating a dictionary of keywords m said text documents: 
forming cat egories of said text documents using said dictionary and an automated 
algorithm: 

counting occurrences of said structured variables, said categories, and combinations of 
said structured variables and said categories for said text documents: and 

calculating probabilities of occurrences of said combinations of said structured 
variables and categories, said probabilities comprising Tho method aoonTding tn r.l mm-l- 
whoroin said method calculat e s a probability that a given co-occurrence of a structured 
variable and a category would have occurred as a purely random event. 

15. (Original) The method according to claim 1 , wherein said structured variables 
comprise predetermined time intervals. 

1 6. (Original) The method according to claim 1 5, wherein said predetermined time 
intervals comprise one of days, weeks, months and years. 

1 7. (Currently amended) A system for identifying relationships between text documents 
and structured variables pertaining to said text documents, comprising: 

an input device for inputting text documents; 

a processor for forming categories of said text documents and counting occurrences of 
said structured variables, categories* and combinations of said structured variables and 
said categories structur e d variable/category combination and calculating probabilities of 
occurrence of said combinations of structured variables and categories to identify a 
relationsh ip between said text documents and said structured variables structured 
variable/category combinations; and 

a display for displaying said probabilities. 

18. (Currently amended) The system according to claim 17, further comprising: 

a memory for storing occurrences of said structured variables, categories* and 
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combinations of structured variables and categories stoe ter e d variabl e /category cnmhrnntinnfi 
and probabilities of occurrences of said combinations of structured variables and categories 
structured variable/category combination s. 

1 9. (Original) The system according to claim 17, wherein said structured variables 
comprise predetermined time intervals. 

20. (Original) The system according to claim 19, wherein said predetermined time 
intervals comprise one of days, weeks, months and years. 

21 . (Original) The system according to claim 17, wherein said system calculates a 
probability that a given co-occurrence of a structured variable and a category would have 
occurred as a purely random event. 

22. (Original) The system according to claim 17, wherein said relationships comprise 
statistically significant relationships. 

23. (Currently amended) A programmable storage medium tangibly embodying a 
program of machine-readable instructions executable by a digital processing apparatus to 
perform a method for identifying relationships between text documents and structured 
variables pertaining to said text documents, said method comprising: 

generating a dictionary of keywords in said text documents; 
forming categories of said text documents using said dictionary and an automated 
algorithm; 

counting occurrences of said structured variables, said categories^ and combinations of 
said structured variables and said categories for structur e d variablo/oatogmry oornhintinonn in 
said text documents; and 

calculating probabilities of occurrences of said combinations of structured variables 
and categories to identify a relationship between said text documents and said structured 
variables structur e d variable/category combinations . 
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